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The  study  objective  was  to  investigate  the  problems 
of  ship  manning  effectiveness.;  specifically  in  the  main- 
tenance  and  repair  areas,  using  various  probabilistic 
modeling  and  data  analytical  techniques  of  operations 
research.  Maintenance  and  Material  Management  data  from 
the  Maintenance  Data  Collection  System  were  used  for 
estimating  failure  rates,  repair  rates  and  maintenance 
deferral  rates  for  each  type  of  equipment.  These  rates 
were  then  used  as  inputs  to  the  mathematical  models. 

The  models  could  then  predict  system  availability  which 
depends  on  manning  level  and  the  rate  of  repairs  deferred 
for  various  reasons. 
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LIST  OP  TABLES 


MTTR  BY  RATING  GROUPS  OP  THOSE  EQUIPMENTS  REPAIREB 
IN  LESS  THAN  OR  EQUAL  TO  ONE  DAY  DURING  NORMAL 
REPAIR  (IN  MAN-HOURS) 


MTTR  BY  RATING  GROUPS  OP  THOSE  EQUIPMENTS  REPAIRED 
IN  TWO  TO  POUR  DAYS  DURING  NORMAL  REPAIR  (IN 
MAN-HOURS) 


MTTR  BY  RATING  GROUPS  OP  THOSE  EQUIPMENTS  REPAIRED 
IN  PIVE  OR  MORE  DAYS  DURING  NORMAL  REPAIR  (IN 
MAN-HOURS) 


MTTR  BY  RATING  GROUPS  OP  THOSE  EQUIPMENTS  PLACED 
IN  DEPERRAL  REPAIR  (IN  IMN-HOURS) 


MTBP  OP  THOSE  EQUIPMENTS  PAILING  IN  LESS  THAN 
365  DAYS  PROM  THE  TIME  OP  LAST  REPAIR  BY  THE 
RATING  GROUP  OP  THE  REPAIRIilAN  V^TIO  MAINTAINED 
THE  EQUIPMENT  IimiEDIATELY  PRIOR  TO  PAILURE  (IN 
DAYS) 


MTBP  OP  THOSE  EQUIHfflNTS  EITHER  PAILING  OR  EXPE- 
RIENCING REDUCED  CAPABILITY  IN  LESS  THAN  365 
DAYS  PROM  THE  TIME  OP  LAST  REPAIR  BY  THE  RATING 
GROUP  OP  THE  REPAIRMAl^  V/HO  MAINTAINED  THE  EQUIP- 
MENT imiEDIATELY  PRIOR  TO  PAILURE  OR  REDUCED 
CAPABILITY  (IN  DAYS) 


AGGREGATE  MTBP,  MTTR,  AND  PROBABILITY  OP  ENTERING 
NORMAL  REPAIR  (MTBP  AND  MTTR  ARE  IN  DAYS) 
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I.  INTRODUCTION 


This  pro;Ject  was  defined  and  was  under  the  direction 
of  Professor  Donald  P.  Gaver,  Department  of  Operations 
Research  and  Administrative  Sciences,  of  the  Naval  Post- 
graduate School,  in  cooperation  with  Dr,  S.  Sorensen  of 
the  Naval  Personnel  Research  and  Development  Center,  and 
was  sponsored  hy  NPRDC.  The  study  objective  was  to  inves- 
tigate the  problems  of  ship  manning  effectiveness,  specif- 
ically in  the  maintenance  and  repair  areas,  using  various 
probabilistic  modeling  and  data  analytical  techniques  of 
operations  research. 

The  author  undertook  a pilot  study:  to  investigate 

the  availability  of  key  equipment  in  the  Communcetions 
Division  of  the  Operations  Department  on  a “KNOX"  Class 
Destroyer  Escort  Ship, 

Maintenance  and  Material  Management  data  from  the 
Maintenance  Data  Collection  System  (3-M/MDCS)  were  used 
for  estimating  failure  rates,  repair  rates,  and  mainte- 
nance deferral  rates.  These  rates  were  then  used  as  inputs 
to  the  mathematical  models  from  which  equipment  availability 
could  be  predicted,  as  the  latter  depends  upon  maintenance 
personnel  available.  Primary  emphasis  was  placed  on  the 
analysis  of  the  available  relevant  data,  which  could  be 
easily  manipulated  to  yield  meaningful  parameters  as  input 
into  the  model. 
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The  Communications  Division  is  a large  system,  itself 
consisting  of  many  complex  sub-systems  which  fail  and  need 
to  he  repaired.  The  entire  ship,  likewise,  may  he  consid- 
ered to  consist  of  several  systems,  each  contributing  to 
the  readiness  and  availability  of  the  ship  for  combat; 
that  is,  the  ability  of  the  ship  to  maintain  some  Readiness 
Condition  for  an  extended  period  of  time. 

Simplify;  ing  assumptions  are  made  in  order  to  reduce 
the  size  and  complexity  of  the  system:  only  those  sub- 

systems judgementally  deemed  critical  to  maintaining 
mission  capabilities  necessary  for  combat  are  explicitly 
considered  as  failure-prone  and  in  need  of  maintenance, 

A single  carrier  Task  Group  was  selected  to  be  the  operating 
environment,  and  no  other  communication  responsibilities 
are  assumed  to  be  placed  upon  the  ship.  One  h’uidred 
percent  availability  of  the  associated  equipment  periph- 
erals was  assumed;  that  is,  microphones,  patch  cords, 
antennae  tnd  various  other  items,  having  spares  readily 
available  and  low  failure  rates. 
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II.  NATURE  Qg  THE  PROBLEM 


A ship  at  sea  must  he  a self -maintaining  entity  in 
order  to  successfully  perform  an  assigned  mission.  The 
proper  manning  levels  could  have  a significant  impact  on 
whether  or  not  a ship  successfully  completes  the  assigned 
mission.  This  impact  could  vary  with  (a)  equipment  repair 
times,  the  latter  being  related  to  the  experience,  ratings, 
and  numbers  of  men  aboard,  (b)  equipment  failure  rates, 
these  depending  on  personnel  skill  and  training,  assuming 
that  personnel  with  the  proper  Naval  Enlisted  Classifica- 
tion to  do  the  work  were  on  board.  Equipment  availability 
also  (c)  depends  upon  spares  availability,  lack  thereof 
influences  the  rate  of  deferrals. 

Representing  one  component  of  mission  success,  equipment 
availability  in  the  Communications  Division  and  its  effect 
on  mission  success  was  considered,  using  the  criteria 
of  the  above  paragraph.  Manning  levels  and  lists  of 
equipments  were  then  identified,  by  arbitrarily  selecting 
a "KNOX"  Class  Destroyer  Escort  as  a base  case. 

The  USS  DOWNES,  DE  1070,'  was  used  as  a prototypic  sit- 
uation from  v/hich  to  gather  eouipment  loadings  and  author- 
ized personnel  manning  levels.  An  Item  Designation  Report, 
Ref,  1.,  Ship  Equipment  Configuration  Accounting  System, 
(SECAS)  Report  Number  502,1,  was  used  to  determine  the 
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quantity  and  location  of  the  equipments  of  interest. 

Manning  levels  for  the  areas  of  interest  were  determined 
by  reviewing  the  appropriate  BUREAU  OP  NAVAL  PERSONNEL 
REPORT  1080-14.  These  equipment  and  personnel  loading 
factors  were  used  to  provide  a basis  from  which  to  initiate 
the  model.  Charges  in  equipment  and  personnel  can  be 
mde  in  the  model,  so  that  (a)  sensitivity  studies  may  be 
made,  and  (b)  the  model  may  be  applied  to  quite  different 
shipboard  envirounents. 

The  capabilities  that  were  considered  to  be  necessary 
for  combat,  and  the  critical  sub-systems  for  each  specific 
capability  are  identified  by  equipment  name  and  Equipment 
Identification  Code.  These  acronyms  and  names  are  defined 
in  Appeniix  A,  The  Networks  were  as  follows: 

(1)  Network  Number  1,  simplified  to  an  uncovered  UHP 
transceiver  the  AIT/SRC-20  (EIC  - QD3S), 

(2)  Network  Number  2,  simplified  to  consist  of  an  uncovered 
UHP  transceiver  the  AN/SRC-20  (EIC  - QB3R). 

(3)  Network  Niomber  3»  simplified  to  consist  of  a UHP 
transceiver  the  AN/URC-9  (EIC  - QD48). 

(4)  Network  Number  4,  simplified  to  be  a UHP  transceiver 
the  AN/URC-9  (EIC  - QB48). 

(5)  Network  Number  5,  simplified  to  be  the  (KY-8)  and 
an  UHP  transceiver  the  AN/SRC-20  (EIC  - QD3R). 

(6)  Network  Number  6,  simplified  to  be  a KW-7  (EIC  - 
QPIO)  and  a HP  transmitter  the  AN/URT-23  (EIC  - QEIN), 
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The  teletypevnriter  and  the  HP  receiver  were  not  considered 
critical  because  spare  systems  exist  to  take  the  load, 

(7)  Network  Nnmber  7,  simplified  to  an  UHP  transceiver 
the  AN/  SRC-21  (EIC  - QD3S),  KW-7  (EIC  - QPIO). 

(8)  Network  Number  8,  simplified  to  the  KG-14  (EIC  - 
QPOQ)  and  KW-37  (EIC  - QP18)  and  a VLP/MP  receiver  the 
AN/WRR-3B  (EIC  - QBIJ). 


The  above  capabilities  and  equipments  were  selected 
for  attention  as  a result  of  consultations  with  Naval 
Officers  who  had  spent  at  least  one  tour  on  a "KNOX” 

Class  Destroyer  Escort  Ship.  These  simplified  specifi- 
cations were  considered  general  in  nature  and  related  to 
the  Command  and  Control  mission  of  this  class  of  ships. 

The  following  block  diagram  illustrates  a minimiam  path 
representation  of  those  equipmei  that  must  operate  in 
order  for  the  system  to  function.  The  equipments  compris- 
ing each  block  are  listed  below.  Where  more  than,  one 
equipment  is  listed  below  a block,  the  equipments  were 
considered  interchangeable.  Specifically  in  the  first 
block,  6 of  7 means  that  at  least  6 pieces  of  the  7 avail- 
able equipments  must  f-unction  for  the  block  to  function, 
and  the  block  must  be  operating  in  order  to  have  the  system 
operate.  The  failure  of  two  pieces  of  equipment  in  block 
one  would  thus  cause  the  system  to  fail  by  this  definition, 
where  in  fact  only  one  of  the  sub-systems  has  failed. 
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System  failure  by  this  definition  is  the  loss  of  at  least 
one  sub-system. 


Figure  1 

Minlmiam  Path  Representation  of  System  Availability 
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INVESTIGATION  OP  DATA 


An  investigation  of  the  data  supplied  by  Pleet  Mater- 
ial Support  Office  was  undertaken  to  deteraine  whether 
meaningful  repair  rates  and  failure  rates  could  be  extracted 
for  each  type  of  equipment  in  the  model.  The  data  that 
were  used  in  this  analysis  were  from  the  complete  3-M/teCS 
file  from  1970  to  July  1975,  covering  all  ships  reporting 
to  the  3-M  system,  on  the  listed  equipments.  It  became 
immediately  apparent  that  the  time  in  man-hours  to  repair 
any  failed  equipment  was  easily  extracted  from  the  data 
source.  This  was  done  for  all  the  equipment.  On  the 
other  hand,  meaningful  data  on  times  between  failures, ' 
leading  to  estimates  of  equipment  failure  rates  were  not 
as  readily  accessible;  this  difficulty  will  be  addressed 
subsequently. 

One  of  the  initial  objectives  of  this  study  was  i. 
relate  manning  levels  to  maintenance  effectiveness,  and 
thus  to  the  reliability  and  availability  of  the  equipment. 
The  data  concerning  times  to  repair  were  further  analyzed 
to  extract  the  mean  time  to  repair  (MTTR)  for  each  critical 
rating  associated  with  the  Communications  Division;  i.e,, 
ETC,  ETl,  ETN2,  ETN3,  ETSN,  ETR2,  EIR3,  RMC,  RMl,  RM2, 

RM3,  RMSN,  These  twelve  ratings  which  reflect  the  skill 
levels  and  pay  grade  of  the  repairmen  were  then  merged 
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into  ei^t  rating  groups  for  this  analysis.  The  title 

given  each  group  reflects  the  majority  of  the  repair  actions 

undertaken  hy  members  of  that  group.  The  titles  and  members 

are  as  follows:  ETl  - ETC,  ETl 

ETN2  - ETN2 
ETN3  - ETN3,  ETSN 
ETR2  - ETR2 
ETR3  - ETR3 
RMl  - RMC,  RMl 
RM2  - RM2 
RM3  - RM3,  RMSN, 

Individualized  MTTR  parameters  were  computed  over  all 
maintenance  actions  in  terms  of  man-hours  expended  during 
the  repair  action  for  organizational  level  repair  actions. 
Then  data  sorts  of  successively  greater  refin.-;ment  were 
made  in  order  to  produce  more  specific  information.  For 
example,  MTTR  was  computed  for  maintenance  a''+ions  completed 
within  one  day  of  detection  of  failure,  bet\  ' two  and 
four  days,  and  for  greater  than  five  days  aft  r failtire. 

This  study  was  directed  toward  the  on-board  mainte- 
nance actions  that  led  to  a completion  of  rerair.  For 
this  reason,  the  author  chose  to  utilize  only  the  ICDCS 
data  cards  MCE  (Maintemance  Closing  Event)  and  CSMA 
(Completed  Shipboard  Maintenance  Action) , The  CSMA 
record  form  is  filled  out  by  the  organization  actually 
performing  the  repair.  It  is  used  for  organizational 
level  or  what  is  referred  to  as  "normal  repair"  in  this 
thesis;  whereas  the  MCE  is  referred  to  as  "deferral  repairs". 
In  each  case  the  information  recorded  by  the  repairing 
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activity  is  entered  in  a very  detailed  format  so  as  to 
facilitate  data  collection  and  extraction.  A complete 
analysis  of  the  available  deferral  data  was  not  pursued. 
Neither  were  the  Shipboard  Alteration  Actions,  since  they 
were  not  a direct  result  of  an  eq.uipment  failiore,  although 
they  do  contribute  to  workload. 

Estimated  failure  rates,  as  presented  in  the  RMA 
Design  Data  Bank  report,  Ref.  2.,  vary  a great  deal,  depend- 
ing on  the  data  used  for  estimation;  from  equipment  specif- 
ication requirements,  resiilts  of  predictions,  test  results, 
to  measurements  during  fleet  use  of  the  equipments. 

One  of  the  reasons  for  the  variations  is  that  actual  oper- 
ating time  of  the  equipments  in  fleet  use  is  rarely  known 
and  thus  real-time  data  can  not  be  accumulated  except 
under  test  situations. 

For  use  in  a model  we  have  analyzed  the  previously 
described  data  set  (MDGS)  in  order  tj  obtain  meaningful 
times  between  failure  on  the  equipments  of  interest. 

A brief  description  of  some  of  the  complexixies  involved 
in  this  simplified  system  migiit  aid  the  reader  in  \inder- 
standing  the  problems  involved  in  computing  failure  rates 
of  equipments  from  fleet  (3-M/MDCS)  data.  Two  of  the 
eleven  equipments  modeled  were  without  a spare,  three  were 
substitutable  for  each  other  and  the  remaining  equipments 
had  at  least  one  spare  available. 


All  data  were  first  sorted  by  EIC  aad  then  by  UIC. 

Times  between  failure  were  computed  using  several  differ- 
ent methods  in  the  process  of  investigating  the  data. 

The  author  first  computed  the  time  between  successive 
failures  of  like  equipment  aboard  the  same  ship.  This 
was  done  by  chronologically  ordering  the  failure  dates 
on  each  ship  for  each  equipment  and  then  computing  the 
desired  statistics.  This  process  was  done  for  both  failure- 
only  items  and  for  failiires  and  reduced  capability  items. 

The  second  method  chosen  was  to  investigate  each  specific 
equipment  as  identified  by  serial  number  aboard  each  ship. 
The  data  were  sorted  by  EIC,  which  identifies  the  equip- 
ment type,  UIC,  which  identifies  the  ship  or  unit  doing 
the  repair,  serial  number,  and  chronological  ordering  of 
failure  dates  and  were  then  used  to  compute  the  MTBP 
for  each  equipment  by  taking  the  difference  between  the 
failure  date  and  the  (I-l)^^  repair  completion  date 
for  each  set  of  qualifying  records. 

The  next  fourteen  pages  consist  of  statistical  summary 
tables  and  explanation  pages  for  each  table  indicating 
what  the  author  thought  was  important  on  each  table. 

The  following  definitions  are  used  throughout  the  tables: 

MTTR  is  the  Mean  Time  To  Repair  as  computed  from  the 
data 

MTBP  is  the  Mean  Time  Betv/een  Failures  as  computed 
component  by  component 
S.D.  is  the  Sample  Standard  Deviation 

M is  the  Sample  Size 
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MTTR  6.C3  1.59  3.43  4.03  2.66  3.75  1.51  4.26  1.26 
S.D.  10.99  1.68  5.68  6.02  1.61  5.47  3.U  7.19  2.99 
M 12  98  163  69  20  19  443  55  408 


The  entries  in  Table  I.  are  (a)  the  mean  time  to 
repair  each  respective  equipment  by  each  listed  repairman 
rating  along  with  (b)  the  sample  standard  deviation  and 
(o)  the  sample  si25e.  The  data  were  sorted  to  reflect 
those  repairs  taking  place  immediately  by,  first,  only 
looking  at  CSMA  records  and,  second,  using  only  those 
records  which  indicated  repair  was  completed  in  a time 
less  than  or  eq.ual  to  one  day. 

ETN2  and  ETN3  rating  groups  were  involved  in  the  major- 
ity of  all  repair  actions  shown.  The  mean  time  to  repair 
within  each  similar  set  of  ratings  for  each  equipment 
is  essentially  the  same,  when  small  sample  sizes  are  dis- 
regarded. 

The  MTTR  from  this  table  were  used  as  input  for  the 
optimistic  normal  repair  rates  in  the  model  that  is  used 
to  predict  system  availability.  The  MTTR  in  hours  indicated 
in  this  table  could  reflect  an  immediate  diagnosis  of 
the  cause  of  fail-ure,  no  delay  in  getting  parts  and  a 
successful  installation. 
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The  entries  in  Table  II.  represent  the  mean  time  to 
repair  each  respective  equipment  by  each  listed  repairman 
rating  along  with  the  sample  standard  deviation  and  sample 
size.  The  data  were  sorted  to  reflect  only  those  records 
of  equipments  entering  normal  repair  by  only  looking  at 
CSMA  records  and,  second,  using  only  those  records  which 
indicated  repair  was  completed  in  greater  than  or  equal 
to  two  days  and  less  than  or  equal  to  four  days  from  the 
discovery  of  equipment  failure.  The  longer  time  period 
to  complete  repair  of  equipment  in  this  case  could  be 
thought  uf  as  being  the  result  of  a complicated  failure 
which  could  not  be  immediately  diagnosed.  This  is  some- 
what supported  by  the  fact  that  in  almost  every  case  in 
Table  II.  the  repair  times  are  greater  than  in  Table  I,, 
and  in  the  majority  of  cases,  greater  by  fifty  percent 
or  more.  If  the  two  to  four  day  delay  were  solely  attrib- 
utable to  waiting  for  parts,  then  the  man-hours  expended 
should  have  been  more  in  line  with  Table  I.,  since  it  took 
significantly  longer,  one  would  tend  to  believe  the 
initial  premise  of  a complicated  failure  followed  by  a 
more  involved  repair  action. 
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She  entries  in  Tatle  III.  represent  the  mean  time 
to  repair  each  respective  eq.uipment  by  each  listed  repair- 
man rating  alor-er  with  the  sample  standard  deviation  and 
sample  size.  The  data  were  sorted  to  reflect  only  those 
records  of  equipments  entering  normal  repair  by  only 
looking  at  CSI/IA  records  and,  second,  using  only  those 
records  which  indicated  repair  was  completed  in  greater 
than  or  equal  to  five  days  from  discovery  of  equipment 
failure , 

The  repair  times  shown  in  this  table  are  significantly 
greater  in  almost  every  case  than  those  in  Table  I,,  as 
might  be  expected. 

What  is  more  suprising  is  that  the  repair  times  in 
Table  III.  are  generally  less  than  those  in  Table  II. 

This  could  reflect  a delay  of  some  sort  on  less  complex 
jobs  than  those  in  Table  II,,  either  for  parts  or  for 
outside  assistance,  and  most  likely  does  since  all  repair 
actions  take  five  or  more  days  even  though  the  average 
number  of  hours  to  repair  the  equipment  was  less  than 
in  Table  II,  Another  point  of  interest  is  that  a higher 
percentage  of  repairmen  of  the  RT/I  and  ETR  groups  worked 
on  the  equipment  \inder  Table  III,  conditions  than  in 
Table  I.  or  Table  II,  This  could  be  the  result  of  a less 
complex  repair  action  delayed  for  parts. 
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The  main  item  of  importance  to  notice  here  is  that, 
even  though  the  time  to  complete  the  repair  action  was 
greater  than  or  equal  to  five  days  the  MTTR  in  man-hours 
expended  is  still  only  from  one  to  seven  hours,  which 
helps  support  the  tenet  that  a lot  of  time  is  spent  waiting 
for  parts  or  waiting  for  other  specific  conditions  in 
order  to  get  started  with  the  job  at  hand. 
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The  entries  in  Table  IV*  represent  the  mean  time  to 
repair  each  respective  equipment  by  each  listed  repair- 
man along  with  the  sample  standard  deviation  and  sample 
size.  The  data  were  sorted  to  reflect  those  repairs  listed 
as  deferrals  resulting  from  operational  priority,  lack 
of  material,  or  the  necessity  of  obtaining  outside  assis- 
tance. This  sort  was  accomplished  by  looking  at  MCE 
records  and  all  repair  times. 

ETK2  and  ETN3  rating  groups  were  again  involved  in 
the  majority  of  all  repair  actions;  however,  the  ETl, 

ETR2  and  ETR3  rating  groups  number  of  repair  actions  in- 
creased considerably  over  those  listed  in  Table  I.  Again, 
within  broad  rating  groups  the  MTTR  were  essentially  the 
same.  The  MTTR  entries  only  reflect  the  man-hours  required 
to  fix  the  equipment  and  thus  does  not  reflect  the  average 
time  to  complete  repair  of  64  days  for  all  deferral  actions 
taken  as  a class.  This  large  difference  in  time  required 
to  complete  a repair  action  is  thought  to  consist  mostly 
of  waiting  time  for  parts  or  outside  assistance. 
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The  data  used  to  derive  Table  V.  were  sorted  in  such 
a manner  that  only  those  failures  occurring  within  365 
days  of  the  previous  repair  action  were  utilized.  This 
method  tends  to  give  a conservative  estimate  of  the  MTBF. 
Each  equipment  type  was  looked  at  component  by  component 
so  as  to  get  the  most  information  about  failvire  time 
from  the  data;  i.e.,  list  all  the  maintenance  actions  on 
one  piece  of  equipment  in  chronological  order  and  record 
the  time  between  completion  of  last  repair  and  the  next 
failure,  when  the  process  is  out  of  data  on  that  particular 
piece  of  equipment,  start  again  with  the  next  one. 

The  data  used  to  derive  Table  VI,  were  sorted  by 
components  that  either  failed  or  were  listed  in  a reduced 
capability  status  during  a 365  day  time  span  from  the 
completion  of  the  last  repair  of  each  component.  This 
method  in  general  yielded  shorter  MTBP  than  those  looking 
at  failures  alone,  such  as  on  Table  V,  It  should  be  noted 
that  the  sample  for  each  category  in  this  table  includes 
the  complete  sample  used  in  Table  V. 
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TABLE  VII 


Table  VII.  is  the  most  general  table  of  all,  and  could 
quite  possibly  contain  the  most  enlightening  information 
also.  Rows  one  and  two  are  straightforward  MTBF  in  days. 
Row  three  entries  are  the  mean  time  to  complete  a repair 
of  a fail\ure  for  all  actions  listed  as  organizational 
level  and  ready  for  immediate  repair,  A range  of  three 
to  thirteen  days  makes  it  rather  hard  to  believe  the 
often  quoted  MTTR  in  terms  of  hovirs  as  indicated  in 
Tables  I - IV,  It  should  also  be  pointed  out  that  over 
75?S  of  the  observations  were  actually  less  than  each  of 
the  respective  MTTR  computed  which  is  also  indicated  by 
the  large  standard  deviations  compared  to  the  location 
of  the  mean. 

If  times  to  repair  were  exponentially  distributed, 
one  would  expect  1 - e -UMTTR)(MIIR)  ^ 0,53  or 
of  the  observations  to  be  less  than  the  MTTR,  this  along 
with  the  larger  standard  deviations  lend  support  to 
saying  that  repair  times  tend  to  be  “hyper  exponential’' , 
having  very  long  tailed  distributions. 

Similar  reasoning  may  be  applied  to  row  five,  which 
contains  both  failures  and  reduced  cabability  records. 

Rows  four  and  six  depict  deferral  type  maintenance  infor~ 
mation  on  the  MTTR  for  each  respective  equipment.  One 
cotild  infer  that  these  long  MTTR  in  days  were  the  result 
of  waiting  for  parts  or  outside  assistance. 
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The  probability  of  entering  normal  repair  as  listed 
in  row  seven  was  computed  by  summing  over  all  maintenance 
actions  of  each  eq_uipment  type  (different  EIC),  the  total 
ntimber  of  MCE  (deferral  repair  actions)  and  the  total 
number.'of  CSM  (nomal  repair  actions).  The  entry  in 
row  seven  was  then  computed  to  be  CSItIA/(CSMA  + MCE). 
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IV.  REPAIRMAN  TYPE  MOISEL 


Individual  pieces  of  equipment  can  fail  and  either 
enter  normal  repair  or  enter  deferral  repair  with  some 
specified  prohahility  for  each  type  of  equipment.  Equip- 
ment failures  are  repaired  on  a First-Come  - First-Served 
basis.  If  a failure  occurs  and  is  not  repaired,  the  system 
is  subject  to  failure.  When  enough  failures  occur  so 
as  to  saturate  the  available  number  of  repairmen,  a repair 
queue  begins  to  build  up  of  those  equipments  waiting  to 
be  repaired.  Equipments  which  can  not  be  repaired  due  to 
lack  of  material  (parts)  or  lack  of  technical  expertise 
are  placed  in  a deferral  repair  status  and  generally 
experience  an  extremely  long  delay  before  the  situation 
can  be  remedied. 

The  deferral  repair  time  distributions  from  Table 
VII.  could  be  used  as  parameters  not  really  dependent 
on  the  number  of  repairmen  available  and  thus  can  be  con- 
sidered in  service  concurrently  with  the  normal  repair 
actions.  Thus  a generalized  view  of  this  model  is  that 
when  equipment  fails  it  is  either  repaired  quickly  or  is 
delayed  for  some  reason.  The  probability  of  being  in 
any  particular  situation  is  an  output  of  the  model  developed 
here. 


This  model  is  based  upon  a simplification  of  the  main- 
tenance process  and  is  desigr.ed  to  utilize  the  results  of 
the  data  investigation  of  Chapter  III,  or  any  other  varia- 
tion a user  might  desire*  One,  two  or  three  repairmen 
may  be  specified  to  work  on  any  number  of  equipments. 

The  model,  is  more  realistic  than  other  repairman  type 
models  in  that  it  allows  for  individualized  failure  rates 
and  repair  rates  for  each  type  of  equipment.  Different 
deferral  probabilities  for  each  equipment  are  also  allowed. 
These  additional  features  of  the  model,  v/hile  increasing 
the  realism  of  the  problem,  also  increase  the  complexity 
of  the  solution  as  indicated  by  Caver  in  Ref.  3. 

Incorporating  manpower  data  and  equipment  reliability 
data  into  a model  was  predicated  on  considerations  in 
three  major  areas: 

(1.)  availability  of  sufficient  and  meaningful  data, 

(2.)  ability  to  vary  factors  of  interest  to  Havy 
planners, 

(3.)  measures  of  system  performance. 

Availability  of  data  has  been  previously  discussed 
in  Chapter  III.  Factors  of  interest  to  the  Navy  planners 
were  manning  levels,  manning  effectiveness  and  equipment 
reliability  and  availability.  The  effect  of  changed 
overall  manning  levels  on  mission  success  can.  be  studied 
by  using  differing  numbers  of  repairmen  in  the  model. 


The  primary  measure  of  system  performance  was  taken  to 
he  the  probability  of  mission  success,  or  one  minus  the 
probability  of  mission  failixre. 

A.  INPUT  ASSUI/IPTIONS 

The  individual  times  between  failures,  times  to 
complete  repairs,  and  times  to  complete  deferred  repairs 
are  assumed  to  be  independently  and  exponentially  distrib- 
uted. This  assumption  seems  credible  as  indicated  in 
Chapter  III,  where  the  number  of  observations  examined 
was  greater  than  three  to  four  hundred.  Basic  parameters 
appearing  in  the  models  are  the  MTBP  for  equipments  as 
well  as  MTTR  (normal,  and  deferred). 

Each  equipment  fails  at  a different  but  constant  rate 
(i=l,  2, . ,N) , and  can  be  repaired  at  the  normal 
(immediate)  repair  rate  (i=l,2, . . • ,N)  or  at  the  deferral 
repair  rate  (i=l,2, , . .,N) . Each  equipment  is  assumed 
to  go  into  immediate  repair  with  probability  p^  or  deferral 
repair  with  probability  qj_=l-p^  (i=l,2, , , . ,N) . These 
rates  and  probabilities  were  determined  from  the  data  of 
Chapter  III. 

Deferral  repair  action  MTTR  over  all  equipment  types 
ranged  from  54  to  92  days  to  repair  a failure.  This 
long  time  was  thought  to  be  a result  of  waiting  for  parts 
or  outside  assistance.  The  MTTR  computed  here  was  so 
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much  greater  than  the  MTTR  for  normal  repairs  (10  to  60 
times  longer)  that  the  author  decided  to  use  the  MTTR 
for  deferrals  as  a repair  time  not  influenced  hy  actual 
repairman  time.  In  other  words,  deferral  repairs  could 
he  going  on  simxiltaneously  with  normal  repairs  even  though 
all  the  repairmen  v/ere  busy.  This  is  due  to  the  long  wait- 
ing time  involved  in  a deferral  action  relative  to  the 
actual  time  required  to  fix  a piece  of  equipment, 

B.  SYSTEM  STATES 

formally,  order  of  failure  is  important  in  determining 
when  an  equipment  begins  to  get  repaired.  This  has  not 
changed  in  this  model,  the  first  item  to  fail  and  enter 
into  the  immediate  repair  queue  is  serviced  first.  In 
practice  a priority  scheme  might  be  followed. 

1.  Finite  Capacity  Asstunption 

Three  failures  of  any  type  are  all  that  are  allowed. 
This  restriction  can  be  changed  by  incorporating  additional 
echlons  of  state  spaces  into  the  model.  Allowing  only 
three  simultaneous  failures  was  considered  adequate, 
since  only  eleven  or  twelve  pieces  of  equipment  v/ere  modeled 
as  a sub-system. 

2.  Labeling  of  States 

The  order  in  which  failures  occur  define  the 
label  of  each  possible  state  for  the  system  to  be  in. 


r 
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The  states  of  this  model  may  he  identified  according  to 
the  following  format: 

N is  the  numher  of  different  equipments  labeled  from 

1 to  N. 

i^jA  (i,d,k=l,2,...,N) 

(0)  no  equipment  failures 

(1)  equipment  numher  i has  failed  and  is  in 
normal  repair 

(ih)  equipment  numher  i has  failed  and  is  in 

deferral  repair 

(i»d)  equipment  numher  i has  failed  first  and  is 
in  normal  repair,  equipment  nimiher  has 
failed  second  and  is  in  normal  repair 

(i,jD)  same  as  (i,j)  with  j in  deferral  repair 

(il),j)  same  as  (i,j)  with  i in  deferral  repair 

(iD,3D)  same  as  (iD,j)  with  j in  deferral  repair 

(i,j,k)  same  as  (i,o)  v/ith  equipment  numher  k has 
failed  third  and  is  in  the  normal  repair 
queue 

(i,j,lcD)  same  as  (i,j,k)  with  k in  the  deferral 
repair  queue 

(ih,3D,k)  same  as  (i,j,k)  v;ith  i and  ^ in  the  deferral 
repair  queue 

(iD,jD,kD)  same  as  (i,3,k)  with  i,  3,  and  k all  in 
the  deferral  repair  queue 

State  (3,4-D,7)  would  mean  that  equipment  numher  3 
had  failed  first  and  is  in  the  normal  repair  queue,  equip- 
ment numher  4 had  failed  second  and  is  in  the  deferral 
repair  queue,  and  equipment  numher  7 had  failed  third 
and  is  in  the  normal  repair  queue. 


Each  piece  of  equipment  to  be  modeled  is  assigned  a 
unique  equipment  number  and  thus  if  there  are  several 
piecey  of  the  same  equipment,  each  has  a unique  number 
even  though  the  failure  and  repair  rates  are  identical 
for  each  equipment. 


C.  BALANCE  EQUATIONS 

The  balance  equations  equate  the  rate  at  v;hich  the 

system  enters  a certain  state  to  the  rate  at  which  the 

system  leaves  that  state.  Now  using  the  previously  defined 
states  and  parameters,  the  following  balance  equations 
for  two  repairmen  are  written  as  a selection  from  the 
entire  system.  Where,  as  an  example,  P , means  the 
probability  of  being  in  state  (jli,i).  The  total  number 
of  states  is  equal  to  1+2(N)+4(N) (N-l)+8(N) (N-1) (N-2) 

for.  this  model. 

K N N 

( P.)  P = R>P-  + RD-  P- 

'1=1  1''  0 1=1  11  1=1  1 1 


'l=l  1''  0 1=1  11  1=1  1 1 

^■^i  ^i  = Vi^i  j^i  ^/o,i 


This  is  to  say  that  the  only  way  to  get  into  state 
(i)  is  to  be  in  state  (o)  and  have  equipment  i fail  and 
go  into  normal  repair  or  to  be  in  state  (;j,i)  or  (jL,i) 
and  have  equipment  j repaired.  The  only  v;ay  to  leave  state 
(i)  is  to  either  have  equipment  i fixed  or  to  have  a failure 
of  another  equipment. 


■ saei-rm'rri-r'- 


N 


(R^  + ED^  + I’l^)  ^’i,j-D  = k;^i^d  Vk,i,;jD  ■** 


N 


*^k"kD,i,jD 


KD,.Pr 


(Rj_  + R^.  + 0) 


Only  allov/ing  three  failure  yields  the  above  eq.uation, 
since  there  are  only  two  repairmen. 


(RJ)^  + + ^k^  ^iD,OD,kD 


“ ^iD,jD'^k^\ 


Here  it  is  noted  that  three  deferrals  are  being  ser- 
viced even  though  there  are  only  two  repainaen.  This  is 
due  to  the  assumption  about  deferral  repair  actions, 

D,  SOLUTION  TECHNIQUE 

The  Gauss-Seidel  iterative  approach  to  the  steady 
state  solution  of  a system  of  balance  eq.uations  was  iised. 
Briefly,  this  approach  is  as  follows: 
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1*  Initiate  Starting  State  Pro~bal)ili1;ie3 
■ . The  most  convenient  assignment  method  is  to  use 
an  equally  likely  distribution  and  assign  each  the  value 
1.0/(total  number  of  states)* 

2.  Turn  the  Crank 

Using  the  present  values  for  each  state,  solve 
for  pQ  and  use  the  nev/  value  for  P^  when  solving  any 
additional  equations  containing  P^.  This  same  method  is 
then  used  for  P^  and  all  of  the  other  possible  states. 

Continue  until  a nev/  probability  has  been  computed  for 
each  state, 

3.  Normalization 

Take  the  sum  of  all  of  the  new  probabilities  ;just 
computed  and  divide  it  into  each  of  the  new  probabilities, 
thus  'ensuring  that  all  probabilities  add  to  tinity,  as 
they  must , 

4 . Check  for  Stopping  Criteria  Satisfaction;  Re-itex''ation 

The  author  chose  to  use  a relative  ratio  method 

to  test  for  a minimal  change  in  probabilities,  since  many 
of  the  probabilities  would  be  extremely  small  and  would 
thus  always  pass  a simple  differencing  technique.  The 
method  v/as  to  take  the  absolute  value  of  the  difference 
between  the  old  probability  and  the  newly  computed  proba- 
bility and  divide  by  the  nev/  probability.  (|  OLD-NEW  | /NEW) 
This  was  done  for  each  state  and  checked  to  see  if  the 
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resiilt  wan  greater  than  0.0001.  If  so,  for  at  least 
one  state,  the  iterative  process  must  continue  until 
all  states  pass  the  stopping  criteria;  i.e,,  GO  TO  2 
and  start  again. 


V.  RESULTS 

The  model  was  programmed  in  Fortran  G and  was  executed 
to  generate  the  results  listed  here.  The  computer  listing 
follows  Appendix  A.  The  output  of  this  model  comes  in 
the  form  of  long  run  probabilities  of  being  in  any  partic- 
ular state.  Selected  sorting  and  summing  procedures 
are  then  used  to  produce  the  probability  that  any  set 
of  eq.uipments  are  down,  and  thus  it  is  possible  to  ca‘'culate 
the  probability  that  the  system  is  down.  This  was  done 
for  each  of  the  sets  of  e<iuipment  described  in  Figure 
2 and  then  the  joint  probability  was  computed  and  used 
in  the  results.  As  an  example  of  the  size  of  the  sorting 
and  summing  task:  using  R=12  equipment  yields  11,113 

different  possible  states  which  are  then  easily  sorted 
and  summed  on  a computer  by  user  supplied  logic  statements 
specific  to  the  information  desired. 

The  author  chose  to  present  two  of  the  multitude  of 
X30ssible  outcomes  in  the  Results  section. 
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A.  SPECIFIC  RESULTS 

1»  Optimistic 

Using  the  normal  repair  times  (Table  I.),  proba- 
bilities of  entering  normal  repair  (Table  VII,),  and  the 
deferral  repair  times  for  failures  (Table  VII.)  resulted 
in  the  following: 

Probability  that  all  equipment  are  up  = 0,4220 

Probability  of  system  unavailability  = 0.1185 

Where  system  down  means  that  at  least  one  sub- 
system or  mission  capability  can  no  longer  fimction  due 
to  equipment  failure.  It  should  be  pointed  out  that  the 
other  seven  sub-systems  could  quite  possibly  be  available, 

2.  Pessimistic 

Using  the  normal  repair  times  for  failures  (Table 
VII,  row  3),  and  keeping  all  other  parameters  the  same 
as  the  optimistic  case. 

Probability  that  all  equipment  are  up  = 0.0988 

Probability  of  system  ixnavailability  = 0,3086 

3.  More  Realistic 

A weighted  average  of  times  to  repair  each  equip- 
ment was  used  to  compute  the  MTTR  used  as  input  for  this 
calculation.  The  method  of  weighting  was  as  follows: 

(1)  sort  the  data  to  reflect  only  normal  repair  records; 

(2)  extract  a frequency  distribution  of  the  number  of 
days  to  repair  a failed  equipment; 
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(3)  use  this  distribution  to  compute  the  weightings 
for  each  of  the  equipments.  The  time-to-repair  factor 
for  the  one-day-or-less  category  was  the  time  in  hoiirs 
from  Table  I.  times  the  percent  of  normal  repair  actions 
occurring  in  one  day  or  less.  The  time  to  repair  factor 
for  the  two~to-four  day  category  was  4-8  hours  times  the 
percent  of  normal  repair  actions  in  that  category.  The 
time  to  repair  factor  used  for  the  five-or~moro  days 
category  was  the  MTTR  for  failure-only  undergoing  normal 
repair  from  Table  VII.  Since  the  MTTR  was  listed  in 
days,  it  was  converted  to  hours  and  multiplied  by  the 
percent  of  normal  repairs  occ-iorring  in  five  or  more  days. 
Generally , the  percentages  for  each  category  were  about: 
65^  in  one  or  less  days;  10^  in  two  to  four  days;  and 
in  five  or  more  days.  The  results  of  using  this 
input  was  as  one  would  expect,  in-between  the  optimistic 
and  pessimistic  cases. 

Probability  that  all  equipment  are  up  = ,3025 
Probability  of  system  unavailability  = .1543 

B.  RESULTS  OP  VARYING  NORMAL  REPAIR  TIMES 

The  following  graphs  were  made  to  show  the  complete 
range  of  good  to  bad  repair  times  and  their  effect  on 
mission  failure. 
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Figure  2,  presents  iDoth  e<iuipiiient  sets  using  1,2 
and  3 repairmen,  over  the  full  range  of  optimistic  to 
pessimistic  values  for  the  normal  MTTR,  There  appears 
to  he  very  little  difference  in  results  between  using 
two  or  three  repairmen  on  Equipment  Set  1 and  thus  two 
repairmen  would  be  preferred.  There  was  essentially  no 
difference  in  results  between  the  differing  number  of 
repairmen  on  Equipment  Set  2 and  thus  one  repairman  would 
be  preferred*  The  realistic  estimates  for  these  sets 
of  equipment  varied  from;  a MTTR  of  2.75  days  and  a system 
down  probability  of  0.08  for  1 repairman  to  0.07  for  2 
or  3 repairmen  on  Equipment  Set  1,  to  a MTTR  of  0.88 
days  and  a probability  of  the  system  being  down  of  0.09 
for  1,2  or  3 repairmen.  It  was  noted  that  1 repairman 
on  Set  2 produced  very  slightly  better  results  than  two 
or  three  repairmen;  this  was  considered  to  be  a result 
of  computer  roundoff  error  since  there  was  essentially 
no  difference  between  them. 


VI.  CONCLUSIONS 


UsefilL  failure  rate  and  repair  time  parametera  can 
be  extracted  from  3-M  data.  Simplified  systems  of  critical 
equipments  and  the  associated  repairmen  can  be  modeled 
to  estimate  a specified  mission  probability  of  sucess. 

The  long  time  to  complete  deferral  repairs  are  basic- 
ally the  limiting  factors  in  computing  mission  success 
or  failure;  i.e.,  the  probability  that  the  system  is 
down  reaches  a limiting  value  does  not  decrease  no  matter 
how  fast  an  item  can  be  fixed  in  normal  repair.  This 
would  seem  to  imply  that  in  order  to  get  more  equipment 
repaired  faster,  the  deferral  repair  times  need  to  be 
reduced. 

This  model  can  be  expanded  to  look  at  cases  allowing 
more  than  three  failtire  quite  easily,  the  only  note  of 
caution  is  that  as  additional  failures  are  included  the 
number  of  possible  states  increases  too,  which  may  at 
some  point  become  too  unv/sidly  to  handle. 

It  is  the  author’s  recommendation  that  further  investi- 
gation into  simplification  modeling  of  systems  be  looked 
at  in  other  areas  in  an  effort  to  reduce  modeling  costs 
without  sacrificing  the  desired  goals  of  the  Navy  planners. 
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APPENDIX 


A 


List 

CSMA 

EIC 

HP 

MCE 

MDGS 

MP 

RCVR 

UHP 

UIG 

VLP 

XCVR 

XMTR 


EIO 

QBIJ 

QB3A 

QD3R 

QD3S 

QD48 

QEIN 

QPOQ 

QPIK 

QPIO 

QPIB 

Q33K 


of  Acronyms,  Abbreviations  and  Eq,uipments 

Completed  Shipboard  Maintenance  Action 
(Normal  Repair  Form) 

Equipment  Identification  Code 
High  Frequency- 

Maintenance  Closing  Event  (Deferral 
Repair  Form) 

Maintenance  Data  Collection  System 

Medium  Frequency 

receiver 

Ultra  High  Frequency 
Unit  Identification  Code 
Very  Low  Frequency 
transceiver 
transmitter 


Equipment  Name 

AN/vmR-3B 

R-1051 

AN/SRC-20 

AN/SRC-21 

AN/URC-9 

AN/URT-23 

KG-14 

KY-8 

KW-7 

KW-37 

AN/UGC-20 


Function 


VLF/r/IF  RCVR 
MF/HF  RCVR 
UHF  XCVR 
UHF  XCVR 
UHP  XCVR 
HP  XMTR 
SECURITY 
SECURITY 
SECURITY 
SECURITY 
Teletypewriter 
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